A DUALITY THEORY FOR SOME NON-CONVEX FUNCTIONS 

OF MATRICES. 
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Abstract. We study a special class of non-convex functions which appear 
in nonlinear elasticity, and we prove that they have a well-defined Legendre 
transform. Several examples are given, and an application to a nonlinear 
eigenvalue problem 



1. Introduction 

We want to define a Legendre transform (y) for functfons F (x), where x is 
an X if matrix, and F involves the various cofactors of x. Note that F tlien 
has to be strongly nonlinear, and nonconvex. The simplest case is when N = K 
and F is a function of the determinant only: F (x) = $ (det a;). We show that the 
Legendre transform of 

F(x) = -\detx\''/'' 
P 

is 

F^ (2/) = -|dety|'^/^ 

q 

with 1/p + 1/q = 1, thereby generalizing the classical duality between spaces. 
The next simplest is when F is a function of the {N — l)-cofactors of x: in the case 
when N = K ~ 3, we give conditions under which F has a well-defined Legendre 
transform. This covers for instance the area functional, defined over 2x3 matrices 
X = (x)) by: 



1/2 



which turns out to be its own Legendre transform (in other words, it is self-dual). 

Note that with our definition, the Legendre transform F^ of F satisfies the usual 
duality relations: 

{F'^f ^F 

The paper is organized as follows. First we define precisely what we mean by a 
Legendre transform. Then we sudy two polar cases. In the first one F is a function 
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of the determinant only, and in the second F depends only on the 2x2 eofactors of 
a 3 X 3 matrix. We give seeveral examples, and we conclude by giving an application 
to a nonlinear eigenvalue problem. 

2. The Legendre transform. 

Let X be a finite-dimensional vector space, and Y its dual, the duality pairing 
being denoted hy < x,y >. Let F : X ^ R he a function and F' (x) G Y its 
derivative at x. The classical formula of Legendre associates with every y Y a 
set Tl {y) C R defined as follows: 

(2.1) TL{y) = {<x,y>-F{x)\y^F'{x)} 

Usually, the right-hand contains several points, so that Fl is a multi- valued 
map from Y to R. We refer to HI for a study of this map. For certains classes 
of functions F, however, the right-hand side is a singleton, so that formula (|2.1|) 
defines a function on Y, which is then called the Legendre transform of F. 

Definition 1. Consider a map F : Q ~> R, where D, is a submanifold of X , and set 
T, ~ F' (fl). We shall say that a function G : S ^ i? is the Legendre transform 
of F if Y, is a submanifold of Y and: 

[xen, y = F' {x)]=^ < x,y > -F {x) = G{y) 
Of course, if F' is one-to-one, this formula becomes: 

G (y) =< {F')-' {y).y> -F ([F'y' {y)) Vy e [F')-' m 

We shall denote the Legendre transform of F by F^, so that F^ = G in the above. 
It follows from the general theory of the Legendre transform (see that F'and 
G' are inverse of each other and that F (x) is the Legendre transform of (y). In 
other words, we have the classic formulas: 

{F'^f^F 
(FT' = [F')' 

There is a well-defined theory of Legendre transform for convex functions. In 
that case, Q, = X and the Legendre formula is replaced by the Fenchel formula: 

F* (y) = sup{yx - F (x)} 

X 

so that differentiability is no longer required. The Fenchel transform F* will co- 
incide with the Legendre transform F^ provided F is and strictly convex. We 
now proceed to give other classes of functions which have a well-defined Legendre 
transform. Roughly speaking, these will be functions F(x), where a; is a matrix 
and F depends only on the eofactors of x. 

Given a number K and some k < K, we shall denote by Pk the set of strictly 
increasing maps of {1, k} into {1, K}: 

Pk = {7t: {1, k} {1, ...,K} I TT (1) < ... < 7r(fc)} 

and by c {K, k) = its cardinal. We can also think of Vk as being the set of 
ordered subsets of {1, ■■■,K} with k elements. Similar notations will hold for Vn 
and c (fc, N), provided k < N. 

Consider a fixed A^-dimensional space E. An element x G E* will have coordi- 
nates Xn, 1 < n < N . Given a family of K linear forms, [x^ , ...,x^), and a number 
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k < imn{K, N}, there are c{K,k) x c{N,k) square k x k matrices which can be 
extracted from the matrix x^. Each of them is specified by a certain choice of k 
fines and k columns, that is by some tt G Pk and some a € Vk- We shafi denote 
it by: 

l<i<k 



•7r(i) 



l<j<k 

and we shafi denote by A^''^(x^, ...,x^) its determinant: 

We then define a map Afe : (E*)^ R<K,k) ^ ^c{N,k) 

(2-2) Afe = {K'X%1 

So the map just associates with a, N x K matrix (a;^) the determinants of 
afi the k X k matrices which can be extracted from it, that is, its fc-cofactors 

Lemma 1. We have: 

n,j '^^^ 

Proof. This is just the Euler identity for fc-homogeneous functions □ 
Lemma 2. Set: 

d 

^(t^'^)" = ^det[a;J] 



We then have: 



det[^(7r,<] = (det[a;-]) 



fe-i 



Proof. We know that (zj)" is if n does not belong to the image of tt, or if j does 
not belong to the image of cr, and that otherwise it is just the cofactor of in 
the matrix (a; J). The last identity then follows from the well-known fact that the 
determinant of a A; x A; matrix raised to the (fc — l)-th power is the determinant of 
the cofactor matrix. □ 



As we stated in the beginning, we are interested in functions of the (a^Jj) which 
involve the A^. We now make this idea precise. Consider the map: 

A = (Ai,...,AK):ii^^-*i? 

if = i?^^ X ... X X ... X 

A function ^ : H R will be called trivial if it depends on the NK first 
coordinates only that is, if it factors through (E*)^"^ . Note that every function 
F : R^^ R can be written F = $0 A, where $ : ^ i? is the identity on {E*)^ 
and sends all the other coordinates to 0. This is called the trivial factorisation. 

Definition 2. A function F : R^^ — > R will be called fc-adapted if it factors 

through Afe , that is, if we have F = <I>oAfe for some function $ : R'^(^''^)'^(^''^) R, 
It is adapted if it factors non-trivially through A, that is, if we have F = $ o A for 
some non-trivial function $ : — > i? such that $ o Aj, 
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We now write the formula for the Legendre transform. Take an adapted function 
F : R^^ R: 

F{x)^F {x\ x^) = F {xi) = $ (A-''^) = $ (A) 
and pair K x N matrices with N x K matrices by: 

<x,y>=J2<y7 

Substitute in the definition (|2.1|) : 




^ "^"^A^ dxi ^ dxi 

\^ , dAl" <J,..^ I « _ V" d-f dAl^ 

k,.,a'^^k dxi fe,.,.^^fe ^a;^" 



k.TT.a- k.TT.a 



) . We rewrite the result in more compact notation: 



r.,..,...„.,^|i:.i^A.-.(A.i»;^i:£S| 

We would like to give general conditions on $ which would ensure that the 
right-hand side is a singleton, so that F has a well-defined Legendre transform. In 
addition, we would like to show that if F is fc-adapted, then F^ is fc-adapted as 
well. Unfortunately, we have not been able to fulfil this program (the calculations 
very quickly become horrendous) so we will be content with two examples. 

3. Functions of the determinant. 

We take N = K. So let x be the square matrix with coefficients x'^. Denote by 
X!^ the cofactor of a;^ in X. We consider functions F : R^ R oi the following 
type: 

F(a;) = $ (detx) 

where $ : i? i? is a function. 

Let us apply the preceding theory. We have: 

(3.1) y^=d^= (det x) -^-j- = 1>' (det x) det X„^ 
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Hence: 

(3.2) ^y^x^ = ci>'(detx)^x:;detX„^ 

71, A; n,k 

(3.3) = N <i>' {det X) det X 

On the other hand, denoting by Y the matrix with coefRcients y^, and by z the 
matrix with coefRcients — detX^, we have: 

(3.4) det Y = ($' (det x)f det z = (det x)^ (det x)^'^ 

Proposition 1. Assume $ is such that the function t — > (t)^ is invertible 

on (a, 6), and let ■0 : {a',b') —^Rbe its inverse. Set Q — {x \ a < detx < b} and 
H = {y \ a' < dety < b'}. Then the function F : Q R given by: 

F{x) = $(deta;) 

has a Legendre transform : E — + i? given by: 

(3.5) F^{y) ^ N ^' {dety)) i}} {del y) - ^ [iP {det y)) 

Proof. Equation (|3.4|l gives detx = -(/'(dety). Writing (|3.3|) and (|3.4|) back into 
formula (|2.3|l . we get: 

Ti (y) = { AT $' (det x) det x - $ (det x) \ y'J = $' (det x) det X'^ } 

= {N {ip (det y)) ijj (det y) - (V' (det y))} 

yielding a unique value. □ 

3.1. Example 1. Take ^ {t) = ^ {tf^^ , with p e i?, so that 

(3.6) F{x) ^ — Idetxl"^^ . 

P 

Note that F is homogeneous of degree p. If p ^ 0, then t^^^^' {t)^ — tP~^, 
which is invertible provided p ^ 1, yielding {s) = s^^^p~^\ Substituting in the 
above, and taking advantage of the fact that $ is homogeneous of degree p/N, we 
get: 

^'^(2/) = (p-l)*(^ (dety)) 

N p 1 

= {p-l) — |dety|f-i " 
P 

Proposition 2. If p ^ and p ^ 1, the function F : R defined by i3.6]} 

has a Legendre transform F^ : R^ — > R defined by: 

N 



F'^{y) = -Idety 



q/N 



with i + i 
p q 



Note that this duality holds for any value of p different from and 1/N, including 
negative ones. Note also that if > 1 (the only interesting case), this duality 
between $ (det X) and "if (det Y) has nothing to do with convexity. On the one 
hand, if p/N > 1, so that $ {t) is convex, then 0<g/A^<l, so that ^' {t) is not 
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convex. On the other hand, it is easy to check that if y =/= 0, we can find a matrix 
X such that J2 ^nUk — ^ ^'^d det a; = 0; it follows that the function 

^ — ' p 

is unbounded from above and from below (consider the sequences Xn = inx), and 
the critical point in the definition of the Legendre transform cannot be a global 
minimum or maximum. 

3.2. Example 2. Take p = 1 in the above, so that $ (i) = Nt'^/^ and: 

(3.7) F{x) = N\dctx\^^^ . 

The function F is defined on the whole of . On the other hand, we have 

f) F 

y"k ^ (x) ^ <i>' (det a:) det 

by (EU, and defy = 1 by lEHI). So F' maps i?^' onto the set ^ = [y \ dety = 1}. 
On the other hand, formula H3.5|) yields quite simply F/, (y) = 0. Hence: 

Proposition 3. The Legendre transform of the function F : — > R given by 
Pri ) IS the function F^ : T, R given by F^ [y) = 

3.3. Example 3. Take $ {t) = In \t\, so that: 

(3.8) F(a;) = ln|deta;| 
Set n^{x\ detx ^ 0}. 

Proposition 4. The function F : Q ^ R defined by has a Legendre transform 
F^ -.n^ R defined by: 

F^ (y) = iV + ln|dety| 

The proof is left to the reader. It follows that the function G [x] = In |det x\+N/2 
is self-dual, i.e. = G 

4. The case of {N - 1)-cofactors 

We shall work with N = K = 3. We presume that similar results hold in the 
general case, but we have not been able to handle the notations. 

Denote by xthe 3x3 matrix with coefficients a;^, with 1 < k < K and 1 < n < iV. 
Denote by the cofactor of x'^ in x, and by its determinant. Set 

Let $ : i?^ ^ i? be given. Consider the function: 

(4.1) F{x) = ^A) 

The formula for the Legendre transform then becomes: 



n,k " n,k " 
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Let us simplify this formula a little bit by setting: 

$t = -—1- (A) 

We then have: 

(4-3) ^HE^}^ = E*}|^(-) 

i,j OX„ 

because, ii k = j or n = i, the variable does not appear in the cofactor X^^. 
If fc 7^ J, we shall denote by p (j, /c) the number in {1, 2, 3} which is different from 
both k and j. Similarly, if n ^ we shall denote by q {i, n) the number in {1, 2, 3} 
which is different from both n and i. \{ k ^ j and i, we have: 



/_.Nm(j,i,fe,n) ( k pU,k) _ k ^p(],k)\ 



where: the exponent m k,n) is ii k > p {j, k) and n > q {i,n), ov if k < p {j, k) 
and n < q{i,n), and m {i, j, k,n) = otherwise. It follows that: 



Q^k - y V 



and hence: 



^ ('_-|^\(fc-p(j,fc))(n-g(i,n)) p(i,fe) 
''k / J \ / j g(i,ra) 

Let us now consider the 3x3 matrix y with coefficients y^, denote by the 
cofactor of and by Z)^ its determinants. Without loss of generality, we can 
assume that n = fc = 1, and we get: 

D\ = ylvl - vlvl 

= {^\xl - ^\x\ - ^\x\ + ^lx\) {^\xl + ^Ixl + ^\xl + ^lx\) - 

{-^\xl - ^\x\ + ^lx\ + ^x\) {-^\xl + ^lx\ - ^Ixl + ^lx\) 

= {^\f {xlxl - xlxl) + ($1$^) {xlx\ - x\xl) + ($1$?) {xlx\ - xlx\) + 

{^\^lxlx\ - ^l^lxlxl) + {^l^X) {-x\xl + xlxl) + {-^l^lx\x\ + ^\^lxlx\) + 

{-^l^lx\x\ + ^l^lxlxX) + {-xlxl + xlxl) + {-^l^\x\x\ + ^lx\^\xl) + 

{-^l^lxlxl + ^lx\^\xX) + {^lx\<^\xl - ^lx\^lx\) + {^lx\^\x\ - ^x\^lx\) + 

{^lx\^\x\ - <^WMx\) + mxl<^lxl - <^l^l<^l^l) 

Lemma 3. // the matrix has rank 1, then the can be expressed in terms of 

the as follows: 

(4.4) 

= n i^l^l + ^l^l + ^lAl + ^Al + ^lAl + ^lAl + ^Al + ^Al) 
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Proof. If the matrix <i>^ has rank 1, aU its 2-cofactors vanish, so that j = 'i'^^fc- 
The previous expression then simphfies: 

Dl = A} + A? + (<i>}$2) Al + 

($}ci>2) A^ + ($1$}) A? + ^3^ 
Ai + ($3$}) A^ + ($3$}) A3 
and $} factors out. □ 

If $ is homogeneous of degree a, the expression 1)4. 4|) simphfies by the Euler 
identity: 

(4.5) = a$^$ = a$ (A) ^ (A) 

and the formula H4.2|l for the Legendre transform of F becomes: 

Tl (y) = {{2a - 1) $ (A) I D^a^ (A) $' (A)} 

Proposition 5. Assume that the function F : ^ R is given by F (x) — ^ (A), 
where $ is homogeneous of degree a, and the matrix (J^/QAjj has rank 1 everywhere. 
Assume that Yj — F' [R^) is a submanifold, and that: 

[Dl = a$ (Ai) $' (Ai) and D2 = a$ (A2) (A2)] => $ (Ai) = $ (A2) 

r/ien F has a Legendre transform F^ : S ^ i? given by 

(4.6) (y) = * (i?) 

where the D^ are the determinants of the 2-cofactors ofy, and [D) = {2a — 1) $ (A) 
for any D such that D ^ a^ (A) $' (A) 

4.1. Example 4. We consider functions F : i?^ — > i? of the following type: 

F {x) ^ ({Y^ Air + (E + (E ^'r] = (a) 

\ n n 71/ 

We have 

^^1-^= f (E ^nT + (E ^")" + (E ' (E ^n)-' 

^ \ n n 71 / n 

SO clearly the matrix has rank 1. The equations (|4.5|1 become: 
Dl = a/3a>^,$ = (c/3)2 /"(^ Ai )" + (E A^)" + (E A^)"] (E ^T'' 

\ 71 71 71/71 

from which we get 

(4.7) Dl = Dl = Dli0Tk=\,2,i 

In other words, the Legendre transform will live on the 3-dimensional subspace 
E of R^ defined by the equations 14.7|l . Setting D]. — Dk for every i, we continue 
the computations: 
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(2/3-1); 



J2w 



{D,)^ = {al3)^ A^)" + Air + (E )" 

\ n n n / 

k \ n n n J 

(EA^r + (EA^r + (EA^rV 



(E^")° 



2a/3-l 



2ag 



Finally, the Legendre transform of F turns out to be the function: 

{y) = {2ap - 1) (a/?)"^ ^E j 

restricted to the 3-dimcnsional subspace S C i?^ defined by the relations D'^ = D^. 
Here, denotes the cofactor of in the matrix Y. 

Note that F is homogeneous of degree 2aP and F^ is homogeneous of degree 
2a/3/ {2a(3 - 1). Setting p = 2q/3 and q = 2a/3/ {2a(3 - 1) , we find that; 

1 1 



1 



p q 



$ ^det 




,det 


x^ x^ 


, det 


1 2 

2^2 *^2 
1 2 


) 




X2 X2 




•^3 '^3 


2^3 ^3 





4.2. Example 5. Let (x^,a;^) be a pair of vectors in R^. We consider functions 
F : ^ R of the following type: 

where \1/ : i?'^ ^ i? is a function. In the previous framework, this can be 
understood as a function F{X), where X is a 3 x 3 matrix, which depends only 
on the first three cofactors. Clearly the rank condition will hold, and the previous 
results apply. It will be more convenient, however, to run through the computations 
again in that particular case, with simplified notations. 
Set A = (A3, A2, Ai) , with: 

— X-^X^ X-^X'2 

A2 = x\x^ — x\x\ 

A\ = X2Xq^ — X2X^ 



We have F {x^^x^) = $ (A). Set = dF/dx^ and compute the cofactors. We 



get: 



y{y2 - 2/22/1 



= yhl - yhl 



9$ 
'dA'2 



A. 



Ai 



9$ 




aAg 


9Ai 


a$ 




aAg 


aA2 







dAi dA2 ) 



Ai + 
A3 + 
A2 + 





9$ 


aA2 


9A3 






dA2 


dAi 







Ao 



Ai 
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We summarize: 
(4.8) £>" : 



dA: 



Ai 



(9$ 



<9$ 
5A7 



1,2,3 



As a particular case, consider the function: 



(4.9) F{x\x^)^ {xlxl - xlxl)" + {x\xl - xjxiy + {xlxl - xlxiy 



When a — 1/ P — 2, this gives the area of the triangle spanned by the vectors x^ 
and x'^. We apply the preceding result, with: 

$ (A3, A2, Ai) = [(Ai)" + (A2)" + (A3)"]'' 
The system H4.8f) becomes: 



D" = [apf [(Ai)" + (A2)" + (A3) 



(A, 



and can easily be inverted (note that ifa = l//3 = 2, we get the identity). We get: 



Substituting into formula H4.6I) , and taking advantage of the fact that $ is homo- 
geneous of degree a(3, we get the Legendre transform: 

(yi, ys) = (2«/3 - 1) [(Ai)" + (A2)" + (A3)"]'' 



{2al3 - l){al3)- 



2a(3-l ■ 



ivlvl - ylvl) + (yl 



Note that if a = 1//3 = 2, we find F = F^: the function F is its own Legendre 
transform. Note also that F is homogeneous of degree 2q;/3 and homogeneous 
of degree 2aP/ {2aP — 1) . Setting p — 2a(3 and q = 2a/3/ (2a/3 — 1) , we find that; 

1 1 



= 1 



P q 



as before. 



5. A VARIATIONAL PROBLEM. 

As a example of possible application of this kind of duality, let us consider the 
following problem. Given a positive definite quadratic form {Ax, x) on and a 

N X iV-matrix /, we want to solve $' (x) = 0, where:: 



1 



$(a:) = -{Ax,x) 



N 



|detx 



p/N 



2 ■ ■ P 

Such points are called critical points of <f>. Any critical point of F solves the 
system: 

Ax^\detx\''/^-^X + f 
where X is the matrice of cof actors of x. 



Proposition 6. If p < 2, there is at least one critical point for $. 
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Proof. Since p < 2, the function $ is coercive: $ (x) — > oo when ||a;|| —^ oo. So it 
attains its minimum at some x, which has to be a critical point. □ 

We now use duahty theory to treat the case p > 2. We shaU use an extension of 
the Clarke duality formula (see |2] ): 

Proposition 7. Suppose F (x) has a Legendre transform (y). Consider the 
functions $ and 4* defined by: 

^{x)^^iAx-f,x)~Fix) 

^{y)^^iAy + f,y)-F'^ (Ay) 
If y is a critical point 0/^*, then x = y + A^^ f is a critical point of 

Proof. If 1/ is a critical point of 'f, we have (y) — 0, and hence Ay + / = 
A (F^y (Ay). Since A is invertible, it follows that y + A^^ f = (F^)' (Ay). Since 

(F^) = F', it follows that 

F' {y + A-\f) =Ay = A{y + A-'f) - / 
so that F' (x) = Ax — /, as desired □ 
Proposition 8. If p > 2, there is at least one non-trivial critical point for <f>. 
Proof. Consider the function: 

^{y)^\{Ay + f,y)-- \dei Ay f"" 

By proposition El the function 4* has a critical point y ^ Q. By proposition it is 
also a critical point of $ □ 

Note that in this case inf $ = —00, so that the critical point x cannot be a 
minimize! . 
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